Crucial factors affecting cooperative multirobot learning
نویسندگان
چکیده
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to learn optimal solutions for an overall multirobot system. We demonstrate that traditional single-robot learning theory can be successfully used with multirobot systems, but only under certain conditions. The success and the effectiveness of single-robot learning algorithms in multirobot systems are potentially affected by various factors that we classify into two groups: the nature of the robots and the nature of the learning. Incorrect set-up of these factors may lead to undesirable results. In this paper, we systematically test the effect of varying five common factors (model of the value function, reward scope, delay of global information, diversity of robots’ capabilities, and number of robots) in decentralized multirobot learning experiments, first in simulation and then on real robots. The results show that three of these factors (model of the value function, reward scope, and delay of global information), if set up incorrectly, can prevent robots from learning optimal, cooperative olutions. s K eywords: multirobot learning, distributed reinforcement learning, cooperative robots
منابع مشابه
Crucial Factors in Cooperative Multirobot Learning
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to learn optimal solutions for an overall multirobot system. We demonstrate that traditional single-robot learning theory can be successfully used with multirobot systems, but only under certain conditions. The success and the effectiveness of single-robot learning algorithms in multirobot systems are ...
متن کاملCrucial Factors Affecting Decentralized Multirobot Learning in an Object Manipulation Task
Decentralized multirobot learning refers to the use of multiple learning entities to achieve the optimal solution for the overall robot system. We demonstrate that single-robot learning theory can be successfully used with multirobot systems, but with certain conditions. The success and the effectiveness of this method are potentially affected by various factors that we classify into two groups...
متن کاملThe Necessity of Average Rewards in Cooperative Multirobot Learning
Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discounted rewards, such as Q learning, do not achieve cooperation (i.e., purposeful division of labor) when applied to task-level multirobot systems. A tasklevel system is defined as one performing a mission that is decompo...
متن کاملReward and Diversity in Multirobot Foraging
This research seeks to quantify the impact of the choice of reward function on behavioral diversity in learning robot teams The methodology developed for this work has been applied to multirobot forag ing soccer and cooperative movement This paper focuses speci cally on results in multirobot forag ing In these experiments three types of reward are used with Q learning to train a multirobot team...
متن کاملMultiple Mobi 40 . Multiple Mobile Robot Systems
Within the context of multiple mobile robot systems, this chapter explores the current state of the art. After a brief introduction, we first examine architectures for multirobot cooperation, exploring the alternative approaches that have been developed. Next, we explore communications issues and their impact on multirobot teams in Sect. 40.3, followed by a discussion of swarm robot systems in ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003